o 

(N 



The Skewes number for twin primes: 
counting sign changes of 7T2{x) — C2l^h{x 



Marek Wolf 

Group of Mathematical Methods in Physics 

University of Wroclaw 

Pl.Maxa Borna 9, PL-50-204 Wroclaw, Poland 



1-C e-mail: mwolf@ift.uni.wroc.pl 



> 

(^ Abstract 

OO 

CN The results of the computer investigation of the sign changes of the 

1^ 

(^ difference between the number of twin primes tt2{x) and the Hardy- 

7—i Littlewood conjecture C2Li2(a;) are reported. It turns out that d2{x) = 

. ^ 712(0;) — C2Li2(a;) changes the sign at unexpectedly low values of x and for 

;_( X < 2^^ = 2.81 ... X 10^^ there are 477118 sign changes of this difference. 

It is conjectured that the number of sign changes of ^2(3;) for x G (1, T) 

is given by vT/log(T). The running logarithmic densities of the sets 

for which d2{x) > and d2{x) < are plotted for x up to 2^^. 
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Let 7r(x) be the number of primes smaller than x and let Li(x) denote the logarithmic 
integral: 

J2 log(M) 
The Prime Number Theorem tells us that Li{x)/ it (x) tends to 1 for x — )■ 00 and the 

available data (see f2M Table 14, p. 175] or [8, Table 5 and 6]) show that always Li(x) > 

it{x). This last experimental observation was the reason for the common belief in the past, 

that the inequality Li(x) > 7c{x) is generally valid. However, in 1914 J.E. Littlewood has 

shown [2U] (see also [7]) that the difference between the number of primes smaller than x 

and the logarithmic integral up to x changes the sign infinitely many times. The smallest 

value xs such that for the first time n{xs) > Li(x5) holds is called Skewes number. We 

have used ">" to avoid the case of integer value of Li(x5), although we believe that for 

n G N there will be Li{n) ^ N, like we know log(?2) is for \/n irrational. In 1933 S. 

Skewes [29] assuming the truth of the Riemann hypothesis argued that it is certain that 

d{x) := n^x) — Li(a;) changes sign for some xs < 10^° . In 1955 Skewes [30] has found, 

without assuming the Riemann hypotheses, that d[x) changes sign at some 

Xs < expexpexpexp(7.705) < 10^° 

This enormous bound for xs was reduced by Cohen and Mayhew [S] to x^ < 10^° 
without using the Riemann hypothesis. In 1966 Lehman [19] has shown that between 
1.53 X 10^^^^ and 1.65 x 10^^^^ there are more than 10^°'' successive integers x for which 
tt{x) > Li(x). Following the method of Lehman in 1987 H.J.J, te Riele [31j has shown 
that between 6.62 x 10'^^° and 6.69 x 10^^° there are more than 10^^° successive integers 
X for which d{x) > 0. The lowest present day known estimation of the Skewes number is 
around 103l^ see [2\ and [27]. 

The number of sign changes of the difference d{x) for x in a given interval (1, T), which 
is commonly denoted by z^(T), see [7], was discussed for the first time by A.E. Ingham in 
1935 [I2J chapter V, [TT] and next by S. Knapowski [16]. Regarding the number of sign 
changes of d{x) in the interval (1,T), Knapowski [TO] proved that 

z/(T) > e-^^ log log log log T (2) 



provided T > expexpexpexp(35). Further results about z^(T) were obtained by J. Pintz 
[2T] . [22] and J. Kaczorowski [13], [Hj. In particular, in [H] Kaczorowski proved that 
there exists such a positive constant C3 that for sufficiently large T the inequality 

z/(r)>C3log(T) (3) 

holds. In [2H] J.-C. Schlage-Puchta proved, assuming the Riemann Hypothesis, that 

.(T)>'^-1. (4) 

More general results on the sign changes can be found in the recent paper [T5] . 

In this paper we will look for the analog of the Skewes number for the twin primes, 
i.e. pairs of primes separated by 2: {(3,5), (5,7), (11,13), . . . , (59, 61), . . . }. 

Let us denote the number of twin primes pairs {p,p + 2) with p + 2 < a; by 'n'2{x). 
Then the unproved (see however [25]) conjecture B of Hardy and Littlewood [9] on the 
number of prime pairs p,p + d applied to the case d = 2 gives, that 

7r2(x) ~ C2l'h{x) = C2 / - — TT^du^ (5) 

J2 log iu) 

where C2 is called "twin constant" and is defined by the following infinite product: 

C2 = 2 n f 1 - - — ^-— ) = 1.3203236316937 ... (6) 

For the ffist time the conjecture pi) was checked computationally up to 8 x 10^° by 
R. P. Brent ^ who noticed the sign changes of the difference 7r2(x) — C2Li2(x), but he 
did not mention neither the analogy with Skewes number nor did not count these sign 
changes. We analyzed the difference d2{x) := n2{x) — C2Li2(x) using the computer for x 
up to T = 2^8 ^ 2.814 X 10^^ It took 195 CPU days to reach T = 2^^ on the 64 bits 
AMD® Opteron 2700 MHz processor. 

To calculate the integral Li2(x) during the main run of the program till 2*^® we have 
used the 10-point Gauss quadrature [23]. This integral was calculated numerically in 
successive intervals between consecutive twins and added to the previous value. Such a 
method is not very time consuming and the number of performed arithmetical operations 
does not depend on x. There are also power series representations of the logarithmic 



integral. We use the following convention for the li(x) (here v. p. stands for French 
valeur principale i.e. Cauchy principal value): 

hence we have Li(a;) = li(x) — li(2). Integration by parts gives the asymptotic expansion: 
, , , X X 2x 6x n\x , , 



log(a;) log^(x) log^(a;) log^(x) log"+^(a;) H ' 

which should be cut at no = [log(a;)J — beginning with this index the following terms 
are increasing. There is a series giving li(x) for all x > 1 and quickly convergent which 
has n! in denominator and log"(x) in nominator instead of opposite order in (pi) (see [3l 
p. 126, Entry 14]) 



r d'U , 1 / X v^log"(a;) ^ 

/ \ — TT = 7 + log log(x) + > for X > 1 

] log(M) ^ ^■^! 



(9) 



where 7 = 0.5772156649... is the Euler-Mascheroni constant and // = 1.451369234883381 
is the Soldner constant defined by (see [31 p. 123, eq.(11.3)]) 

\\{\x) = v.p. / - — -- = 0. 
Jo log(«) 

Even faster converging series was discovered by Ramanujan |3], p. 130, Entry 16]: 

-^^ ^^ ^ n=l fc=0 

Because we have 



Li2(x) = Li(x) — 



log(x) 

it is possible to calculate values of Li2(x) using the above series. Disadvantage of these 
series is that the number of operations (including time consuming calculation of log(x)) 
increases with x and is larger than number of operations needed in the numerical integra- 
tion. 

As for the set of all primes initially the inequality C2Li2(x) > 7r2(x) holds, but it turns 
out that there are surprisingly many sign changes of d2{x) = tt2{x) — C2Li2(x) for x in 
the interval (1,2""^). The first sign change of d2{x) appears at the twin pair (1369391, 



1369393) and up to T = 2^^ there are 477118 sign changes of ^2(3^)- We have collected 
positions of all these sign changes in one file which is available for downloading from 
http://www.ift.uni.wroc.pl/niwolf/Skewesy_twins.zip. Let i'2{T) denote, by analogy with 
usual primes, the number of sign changes of ^2(3;) in the interval (1, T). The Table I con- 
tains the recorded number of sign changes of tt2{x) — C2Li2(a;) up to T = 2^^, 2^^, . . . , 2^^. 
We have checked the numbers i'2{T) up to T = 2^*^ = 1.718 x 10^° independently calcu- 



lating the integral Li2(x) from the series (10) and these results are presented in Table I 
in the third column and are marked with asterisk. The first 1274 positions of sign 



TABLE I 

The number of sign changes of d2 (x) 



T 


^2(T) 


^2(T)(*) 


v^/log(T) 


T 


^2(T) 


VT/logiT) 


221 


29 


29 


99 


235 


12682 


7641 


222 


29 


29 


134 


236 


23634 


10505 


223 


29 


29 


182 


237 


31641 


14455 


224 


29 


29 


246 


238 


31641 


19905 


225 


29 


29 


334 


239 


31641 


27428 


226 


238 


238 


455 


240 


38899 


37819 


227 


854 


854 


619 


241 


55106 


52180 


228 


1226 


1226 


844 


242 


90355 


72037 


229 


1226 


1226 


1153 


243 


161031 


99506 


230 


1226 


1226 


1576 


244 


161031 


137525 


231 


1226 


1226 


2157 


245 


161031 


190168 


232 


2854 


2852 


2955 


246 


405289 


263091 


233 


7383 


7381 


4052 


247 


472000 


364151 


234 


9115 


9113 


5562 


248 


477118 


504258 



changes of d2{x) obtained by these two methods of calculating the integral Li2(a;) were 
the same. The first difference between both methods appears at twin pairs (3067608611, 
3067608613) and (3067609091, 3067609093), which were not detected using the more 



accurate formula (llOj). Next twin primes detected by the two methods are the same until 
the twin pairs (7809444029, 7809444031). In general, among over 9100 sign changes up 
to 2'^^ there were 17 differences in the positions of sign changes of d2{x) obtained by two 
methods of calculating the integral Li2(a;). 

The values of T searched by the direct checking are of small magnitude from the point 
of view of mathematics, but large for modern computers. 

The observed numbers i'2{T) behave somewhat erratically, see Fig.l, in particular 
there are large gaps without any change of sign of the d2{x). If one assumes the power- 
like dependence of i^2iT) then the fit by the least square method gives the function aT^, 
where a = 0.2723 . . . and b = 0.4389 .... Instead of such accidentally looking parameters of 
the pure power-like dependence we suggest the function \/T/ log(T) as an approximation 
to i'2{T) — it is a more natural function, without any free parameters and taking values 
very close to the least square fit aT^, see Figure 1. Thus we state the following conjecture: 

z/2(r)~v^/log(T) . (11) 

We have picked out function yT/\og{T) after a few trials and we are not able to give 



even heuristic arguments in favour of it. The conjecture (11) is supported by the fact 
that there are 10 crossings of the curve y/T/\og{T) with the staircase-like plot of i^2iT) 
obtained directly from the computer data. The last column in the Table 1 contains the 



values of the function vT/log(T). If the conjecture (11) is true, then there is infinity of 
twins. 

It seems to be very difficult to gain some analytical insight to why there are so many 
sign changes of 7r2(x) — C2Li2(x). As (tsl) is not proved, hence error term for it is also not 
known (for heuristic approximate formula for averages of the remainders in the Hardy- 
Littlewood conjecture B see [IB])- The best error term for Prime Number Theorem under 
the Riemann Hypothesis is |7r(x) — Li(x)| = 0{^/xlog{x)). In the Fig.2 we present the 
computer data for two functions: the running difference d{x) = Li(a;) — it{x) and the error 
term: 

A(x) = max |7r(t) - Li(t)| , (12) 

Characteristic oscillations of d{x) are fully described by the explicit formula for n{x), see 



e.g. [HI formula (3) and Figure 4]. In the Fig. 3 |(i2(3;)| and the error term 

A2(x) = max |7r2(t) - C2Li2(t)| (13) 

is plotted for x < 2^^. As it is seen from these figures the behavior of d{x) and d2{x) is 
completely different with rapid oscillations of d2{x) of many orders. However the functions 
A(x) and A2(a:;) are quite similar: the error term for twins A2(x) is smaller than A (a;) 
but the difference is not significant: the power-like fits to A(x) and A2(a;) give: 

ax^, a = 0.209..., /5 = 0.45... for A(a;) (14) 

a2x'^\ ^2 = 0.337..., /32 = 0.418... for A2(x). (15) 

Here the slopes (3 ^ (32 and prefactors a and 0^2 are very close. Thus it seems that 
the sizes of the error terms do not account for enormous difference in the value of Skewes 
number. In fact all considerations of Skewes, Kaczorowski and others were based on 
existence of explicit formulas and there are no analogs of explicit formulas for twins. 
However Turan [33] introduced the following Dirichlet series with the aim to study twins: 

n^)^-E ^""'.r^'' (»--!). (16) 

n>3 

where A(n) is the von Mangoldt function: 



r 



A{n) 



if n = 1 

\ogp ii n = p^ for some prime p and integer /c > 1, \^'^) 

if n has at least two different prime factors. 



In 2004, in a preprint publication [T] Arenstorf attempted to prove that there are infinitely 
many twins. Arenstorf tried to continue analytically T{s) — 6*2/(5 — 1) to 5Re s = 1, but 
shortly after an error in the proof was pointed out by Tenenbaum [32j . For recent progress 
in the direction of the proof of the infinite number of twins see |17j . 

The comparison of Figures 2 and 3 shows, that vr2(x) ~ C2Li2(x) is better than 
7r(x) ~ Li(x) in the sense that there are almost half a million points where d2{x) is zero 
in the Fig.3 while in the Fig. 2 there are no crossings of x axis at all. This observation 
can be quantifying with the notion of the logarithmic density. In ^26j it was proposed to 



use the logarithmic density to measure the different biases in the distribution of prime 
numbers. In particular, for the case of the sign changes of d{x) it was shown that the 
logarithmic density of the set {x : Li(x) < 7r(x)} defined by 

S{x:Uix)<7T{x)} = hm - — -- V - (18) 

x^cx) log X ^-^ n 

Li(n)<7r(n) 

is equal to S{x;U(x)<tt(x)} = 2.7... x 10"'^. Hence in some precisely defined sense the 
inequality Li(a;) > tt{x) holds almost everywhere. Here we will define two logarithmic 
densities for twin primes as follows: 



lim 



T^ E - (19) 



x^ca log(x) ^-^ n 



2<n<x 
d2H>0 



hm -\- V -. (20) 

.^oolog(x) ^^^. n 



rf2(n)<0 

We do not have at our disposal any formulas like those in |26] and we have to turn to 
the brute force numerical calculation of finite size approximations (5+(x) and ^-[x) given 



by expressions (19) and (20) without limit operation Xwn^^ao- In these computation we 
have used positions of all sign changes collected earlier. The resulting running logarithmic 
densities are plotted in Figure 4. The sum for b-[x) starts from 1/5, because 5 is the 
end of the first twin primes pair. It is a reason why the plot of (5_ (x) in Fig. 4 starts from 
about 0.67. Up to x = 2'^^ the data for Figure 4 was obtained by direct summing of the 
harmonic sums, for x > 2^^ ^ 2.15 x 10^ the incredible accurate approximation [6], p^ 
pp. 76-78]: 

gl=log(,„+l)_,e,(„_l)+o(J_) (21) 

was used (the implied in O constant is much smaller than 1). For n ~ 10^ the error made 
by using the above formula is of the order 10^^^. To calculate the harmonic series up to 
X = 2.8 X 10^^ directly by adding all numbers Xjn would take from one to a few months 
of CPU time, depending on the processor. The plots presented in Fig. 4 suggest following 
the conjecture 

K = ^- = \ (22) 



The difference of many hundreds of orders between values of x such that 7r(x) — Li(x) 
and 7r2(x) — C2Li2(a;) changes the sign for the first time is astonishing. We can give 
an example from physics. Let us make the mapping: sign changes of d{x) correspond to 
energy levels of hydrogen and sign changes of d2{x) correspond to the spectrum of helium. 
Then ground states of hydrogen and of helium will correspond to Xs and first sign change 
of d2{x) accordingly. The experiments show that the energies of the ground states of the 
hydrogen and helium are -13.6 eV and -79 eV respectively and do not differ by hundreds 
of orders! 
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Fig.l The plot showing the comparison of the actual values of i'2{T) found by a computer 



search with the conjecture (11). There are 10 crossing of the function i'2{T) and 



^T/log(T) in this plot up to 2^®. All 477118 sign changes of d2{x) are plotted. In the 
inset plot on the double logarithmic scale is presented. 
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X 

Fig. 2 The plot of d{x) and error term A(x). The power fit was made for 10^ < x < 2^^. 
The first crossing of the axis x will appear around 10^^^. 
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X 

Fig.3 The plot of |(i2(x)| and error term A2(x). Sign changes of the d2{x) and values 
smaller than 10^^ were artificially set to 10~^. In blue the power-like fit 0.337 x x^-^^^ to 
A2(x) obtained by the least-square method is plotted. 
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lllllllll lllllllll lllllllll I llllllll I llllllll lllllllll lllllllll lllllllll lllllllll lllllllll lllllllll lllllllll I 

10^ 10^ 10"^ 10^ 10^ 10^ 10^ 10^ 10^° 10^^ 10^^ 10^^ 10^^ 

X 

Fig.4 The plots of the running logarithmic densities (5+(x),5_(x) defined in the text. 
Each plot consists of 28025 points: the values of (5(a;)'s were recorded at the progression 
a; = 100 X (1.001)". 



